OcrV1, Main, Exploration, bibRecord, 001105

Bangla/English Script Identification Based on Analysis of Connected Component Profiles

Identifieur interne : 001105 ( Main/Exploration ); précédent : 001104; suivant : 001106

Bangla/English Script Identification Based on Analysis of Connected Component Profiles

Auteurs : Lijun Zhou [République populaire de Chine] ; Yue Lu [République populaire de Chine] ; Lim Tan [Singapour]

Source :

Lecture Notes in Computer Science [ 0302-9743 ] ; 2006.

RBID : ISTEX:AC2AD06559B5D1AAFC160E94B92A3B368A31357D

Abstract

Abstract: Script identification is required for a multilingual OCR system. In this paper, we present a novel and efficient technique for Bangla/English script identification with applications to the destination address block of Bangladesh envelope images. The proposed approach is based upon the analysis of connected component profiles extracted from the destination address block images, however, it does not place any emphasis on the information provided by individual characters themselves and does not require any character/line segmentation. Experimental results demonstrate that the proposed technique is capable of identifying Bangla/English scripts on the real Bangladesh postal images.

Url:

https://api.istex.fr/document/AC2AD06559B5D1AAFC160E94B92A3B368A31357D/fulltext/pdf

DOI: 10.1007/11669487_22

Affiliations:

Links toward previous steps (curation, corpus...)

to stream Istex, to step Corpus: 001B47
to stream Istex, to step Curation: 001A37
to stream Istex, to step Checkpoint: 000A78
to stream Main, to step Merge: 001122
to stream Main, to step Curation: 001105

Le document en format XML

<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Bangla/English Script Identification Based on Analysis of Connected Component Profiles</title>
<author><name sortKey="Zhou, Lijun" sort="Zhou, Lijun" uniqKey="Zhou L" first="Lijun" last="Zhou">Lijun Zhou</name>
</author>
<author><name sortKey="Lu, Yue" sort="Lu, Yue" uniqKey="Lu Y" first="Yue" last="Lu">Yue Lu</name>
</author>
<author><name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:AC2AD06559B5D1AAFC160E94B92A3B368A31357D</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/11669487_22</idno>
<idno type="url">https://api.istex.fr/document/AC2AD06559B5D1AAFC160E94B92A3B368A31357D/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001B47</idno>
<idno type="wicri:Area/Istex/Curation">001A37</idno>
<idno type="wicri:Area/Istex/Checkpoint">000A78</idno>
<idno type="wicri:doubleKey">0302-9743:2006:Zhou L:bangla:english:script</idno>
<idno type="wicri:Area/Main/Merge">001122</idno>
<idno type="wicri:Area/Main/Curation">001105</idno>
<idno type="wicri:Area/Main/Exploration">001105</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Bangla/English Script Identification Based on Analysis of Connected Component Profiles</title>
<author><name sortKey="Zhou, Lijun" sort="Zhou, Lijun" uniqKey="Zhou L" first="Lijun" last="Zhou">Lijun Zhou</name>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Department of Computer Science and Technology, East China Normal University, 200062, Shanghai</wicri:regionArea>
<wicri:noRegion>Shanghai</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Lu, Yue" sort="Lu, Yue" uniqKey="Lu Y" first="Yue" last="Lu">Yue Lu</name>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Department of Computer Science and Technology, East China Normal University, 200062, Shanghai</wicri:regionArea>
<wicri:noRegion>Shanghai</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Shanghai Research Institute of Postal Science, China State Post Bureau, 200062, Shanghai</wicri:regionArea>
<wicri:noRegion>Shanghai</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<affiliation wicri:level="4"><country xml:lang="fr">Singapour</country>
<wicri:regionArea>Department of Computer Science, School of Computing, National University of Singapore, Kent Ridge, 117543</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2006</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">AC2AD06559B5D1AAFC160E94B92A3B368A31357D</idno>
<idno type="DOI">10.1007/11669487_22</idno>
<idno type="ChapterID">22</idno>
<idno type="ChapterID">Chap22</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Script identification is required for a multilingual OCR system. In this paper, we present a novel and efficient technique for Bangla/English script identification with applications to the destination address block of Bangladesh envelope images. The proposed approach is based upon the analysis of connected component profiles extracted from the destination address block images, however, it does not place any emphasis on the information provided by individual characters themselves and does not require any character/line segmentation. Experimental results demonstrate that the proposed technique is capable of identifying Bangla/English scripts on the real Bangladesh postal images.</div>
</front>
</TEI>
<affiliations><list><country><li>République populaire de Chine</li>
<li>Singapour</li>
</country>
<orgName><li>Université nationale de Singapour</li>
</orgName>
</list>
<tree><country name="République populaire de Chine"><noRegion><name sortKey="Zhou, Lijun" sort="Zhou, Lijun" uniqKey="Zhou L" first="Lijun" last="Zhou">Lijun Zhou</name>
</noRegion>
<name sortKey="Lu, Yue" sort="Lu, Yue" uniqKey="Lu Y" first="Yue" last="Lu">Yue Lu</name>
<name sortKey="Lu, Yue" sort="Lu, Yue" uniqKey="Lu Y" first="Yue" last="Lu">Yue Lu</name>
</country>
<country name="Singapour"><noRegion><name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001105 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001105 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:AC2AD06559B5D1AAFC160E94B92A3B368A31357D
   |texte=   Bangla/English Script Identification Based on Analysis of Connected Component Profiles
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Bangla/English Script Identification Based on Analysis of Connected Component Profiles

Bangla/English Script Identification Based on Analysis of Connected Component Profiles

Source :

Abstract

Links toward previous steps (curation, corpus...)

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri